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SPECIFICATION 

1 . Title of the Invention 

Data compression and decompression method 

2. Scope of Patent Claims 

In a device that conducts Huffman compression 
and decompression of data using a compression table, 
a data compression and decompression method char- 
acterized in that it comprises a sampling means that 
obtains the current frequency of occurrence of char- 
acters and a reading and writing means that reads and 
writes character data in frequency of occurrence or- 
der. 

3. Detailed Description of the Invention 
(Abstract) 

The present invention relates to the compression 
and decompression of data, and in order to improve 
data compression rates, it permits the table contents 
of the hitherto fixed compression table to be updated 
based on an analysis of the data that is the compres- 
sion target, thereby providing the optimum compres- 
sion for each compression target data. 

(Field of Industrial Application) 

The present invention relates to a data compres- 
sion method, specifically Huffman compression. 



Huffman compression is a technique for com- 
pressing character data into even shorter bit data, and 
it is generally used after a large volume of data has 
been character compressed. However, depending on 
the compression target data, the compression rate 
may deteriorate greatly, and a technique is needed to 
prevent this. 

(Prior Art) 

In Huffman compression, a compression table 
that assigns short bit strings in the order of characters 
that are thought to have high frequencies of occur- 
rence is provided and data compression and decom- 
pression is conducted using this table. 

(Problem to Be Solved by the Invention) 

In conventional Huffman compression, charac- 
ters are recorded in the order in which they are 
thought to generally occur most frequently. However, 
because this Huffman table is fixed, the method has 
the disadvantage that compression rates may be 
greatly degraded when the data contains large 
amounts of special characters. 
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(Means for Solving the Problem) 

The Huffman compression method of the present 
invention is configured so as to comprise, in a device 
that conducts Huffman compression and decompres- 
sion of data using a compression table, a sampling 
means that obtains the current frequency of occur- 
rence of characters and a reading and writing means 
that reads and writes character data in frequency of 
occurrence order. 

(Operation) 

Figure 1 shows an example of an embodiment 
according to the present invention. In the figure, HSP 
is a Huffman compression and decompression sec- 
tion, HTBL is a Huffman compression table, FRM is 
a sampling means that obtains characters in fre- 
quency of occurrence order, and R/W is a reading 
and writing means that reads and writes data in fre- 
quency of occurrence order. 

The Huffman compression and decompression 
section HSP comprises an input buffer IBUF that 
reads the data that is the target of compression or 
decompression, an output buffer OBUF that outputs 
the compressed or compressed data, and a conversion 
section HCDP that compresses or decompresses the 
data. The conversion section HCDP references the 
compression table HTBL and converts characters to 
bit strings or bit strings to characters. The compres- 
sion table HTBL stores a character CHR and a corre- 
sponding bit string BIT that constitute a conversion 
pair. The processing above is the portion that is iden- 
tical to conventional compression and decompression 
processing. 

The sampling section FRM that is a feature of 
the present invention comprises a character selection 
section CSEL, a frequency of occurrence counting 
section CTR, a character buffer CBUF that stores 
occurring characters CHRA and the number of their 
occurrences, a sorting section SRT that sorts the char- 
acters in frequency of occurrence order, and a sorting 
buffer SBUF that stores the occurring characters in 
the order of their frequency of occurrence. 

In the case of data compression, the sampling 
section FRM reads the data placed in the input buffer 
IBUF character by character and places each newly 
occurring character in the character field CHAR in 
the character buffer CBUF, and at this time, the 
counting section CTR sets a 1 in the frequency field 
FRQ that corresponds to that field in the character 
buffer CBUF as the frequency of occurrence. There- 
after, whenever a previously occurring character ap- 
pears, the counting section CTR increases the fre- 
quency of occurrence. When the reading of all char- 
acters has been completed, the sorting section SRT 
sorts the characters in the order of the values in the 
frequency filed FRQ and outputs the characters to the 



sorting buffer SBUF. The reading and writing means 
R/W writes the group of characters in frequency of 
occurrence order to the compression table HTBL and 
also writes the compression data to the output buffer 
OBUF. 

Thereafter, Huffman compression processing is 
executed by the conversion section HCDP. 

In the case of data decompression, the reading 
and writing means R/W reads the character string 
data in frequency of occurrence order that has been 
written at the beginning of the data in the input buffer 
IBUF and writes it to the compression table HTBL. 
Thereafter, Huffman decompression processing is 
executed by the conversion section HCDP. 

Figure 2 shows an example of an embodiment 
where the compressed data has been output to a mag- 
netic tape. The group of characters that is stored in 
the Huffman compression table has been written in 
frequency of occurrence order in the data header HD 
and each blocked compressed data has been written 
to DAI, DA2, DA3, ... 

In this embodiment according to the present in- 
vention, the group of characters that is stored in the 
compression table has been written at the beginning 
of the compressed data, but it is not necessary that the 
group of characters that is stored in the compression 
table always be added to the beginning of the com- 
pressed data, and a means that sets the contents of the 
compression table from a separate compression occa- 
sion into the compression table at decompression 
may be used. This method is effective when the con- 
tents of the data change periodically. 

In addition, because the units in which compres- 
sion and decompression is conducted using the same 
compression table are unrelated to the volume of 
data, a completely different compression table may 
be used temporarily in the midst of a single com- 
pressed data. This method is effective when process- 
ing data formed of character group blocks that are 
extremely different. 

(Effect of the Invention) 

The present invention permits optimum Huffman 
compression to be conducted by setting a group of 
characters in frequency of occurrence order in a com- 
pression table for Huffman compression according to 
the data. 

4. Brief Description of the Drawings 

Figure 1 is an example of an embodiment of 
Huffman compression according to the present inven- 
tion. Figure 2 is an example of an embodiment in 
which compressed data has been output to a magnetic 
tape. 

In Figure 1, HSP is a Huffman compression and 
decompression section, HTBL is a Huffman com- 
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pression table, FRM is a sampling means, and R/W is 
a reading and writing means. 



Patent attorney Takuichi IKEDA [seal illegible] 
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